PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG044944t1
Common NameTCM_044944
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HD-ZIP
Protein Properties Length: 843aa    MW: 93980.5 Da    PI: 6.9085
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG044944t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox642.2e-202479156
                      TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                      +++ +++t++q+++Le++F+++++p++++r +L+++lgL  rq+k+WFqNrR+++k
  Thecc1EG044944t1 24 KKRYHRHTAHQIQRLEAMFKECPHPDEKQRLQLSRELGLAPRQIKFWFQNRRTQMK 79
                      678899***********************************************998 PP

2START176.41.8e-552244452206
                       HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEE CS
             START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kae 80 
                       +a  a++el+++ + +ep+W ks     + +n + + ++f+++++       ++ea+r+sgvv+m+   lv  ++d++ +W   ++     a+
  Thecc1EG044944t1 224 IAANAMEELLRLLQTNEPLWIKSTndgrDVLNLESYERIFPRANThfkspnVRIEASRDSGVVIMNGLALVDMFMDSN-KWVDLFPtivsMAK 315
                       67889*******************99999999999*****99999*****99**************************.999999999989** PP

                       EEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEE CS
             START  81 tlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtw 166
                       t+e issg      g lqlm+ elq+lsplvp R+f+++R+++q + g w+iv +S d +q+ ++     R+++lpSg+li++++ng+skvtw
  Thecc1EG044944t1 316 TIEIISSGmmgshsGSLQLMYEELQVLSPLVPtREFYILRFCQQIEPGLWAIVNISYDFPQFASQ----CRSHRLPSGCLIQDMPNGYSKVTW 404
                       **************************************************************987....************************ PP

                       EE-EE--SSXX.HHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
             START 167 vehvdlkgrlp.hwllrslvksglaegaktwvatlqrqcek 206
                       +ehv+ ++++p h l+r l++sgla+ga +w+atlqr ce+
  Thecc1EG044944t1 405 MEHVEIEDKTPtHRLYRDLIHSGLAFGAERWLATLQRMCER 445
                       ***************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466897.52E-21881IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.7E-231183IPR009057Homeodomain-like
PROSITE profilePS5007118.0742181IPR001356Homeobox domain
SMARTSM003891.2E-192285IPR001356Homeobox domain
CDDcd000867.40E-192382No hitNo description
PfamPF000465.2E-182479IPR001356Homeobox domain
PROSITE patternPS0002705679IPR017970Homeobox, conserved site
PROSITE profilePS5084853.199214448IPR002913START domain
SuperFamilySSF559613.63E-37215446No hitNo description
CDDcd088753.42E-118218444No hitNo description
SMARTSM002342.7E-46223445IPR002913START domain
PfamPF018522.6E-48224445IPR002913START domain
Gene3DG3DSA:3.30.530.208.9E-7332443IPR023393START-like domain
SuperFamilySSF559612.09E-22466670No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009828Biological Processplant-type cell wall loosening
GO:0010091Biological Processtrichome branching
GO:0005634Cellular Componentnucleus
GO:0016021Cellular Componentintegral component of membrane
GO:0000976Molecular Functiontranscription regulatory region sequence-specific DNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 843 aa     Download sequence    Send to blast
MEYGSGGGSG SGGDHDASDP SRRKKRYHRH TAHQIQRLEA MFKECPHPDE KQRLQLSREL  60
GLAPRQIKFW FQNRRTQMKA QHERADNCAL RAENDKIRCE NIAIREALKN VICPSCGGPP  120
ATEDSYFDEQ KLRMENAQLK EELDRVSSIA AKYIGRPISQ LPPVQPIHVS SLDLTMASFG  180
GHGVGVAGPS LDLDLLPGSS STMPNLPFQP IAISDMDKSL MTDIAANAME ELLRLLQTNE  240
PLWIKSTNDG RDVLNLESYE RIFPRANTHF KSPNVRIEAS RDSGVVIMNG LALVDMFMDS  300
NKWVDLFPTI VSMAKTIEII SSGMMGSHSG SLQLMYEELQ VLSPLVPTRE FYILRFCQQI  360
EPGLWAIVNI SYDFPQFASQ CRSHRLPSGC LIQDMPNGYS KVTWMEHVEI EDKTPTHRLY  420
RDLIHSGLAF GAERWLATLQ RMCERFACLM VSGTSTRDLG GVIPSPDGKR SMMKLAQRMV  480
NNFCTSISTS NSHRWTTLSG LNEVGVRVTV HKSSDPGQPN GVVLSAATTF WLPVTPQNVF  540
NFFKDERTRP QWDVLSNSNA VQEVAHIANG SHPGNCISVL RAFNTSQNNM LILQESCIDS  600
SGSLVVYCPV DLPAINVAMS GEDPSYIPLL PSGFTISPDG HPDQGDGAST SSTVHGNMGR  660
SGGSLITVAF QILFLMIWFP YCYTTVLPLS PSSLNIEIRE IQSLFALVVF NVREVLWLVG  720
TFNALSSRAE AAYLLLAAAI AVSAAKLDYN LQQLMFFIYF YFLGGFFTRH FGFLQFTLEE  780
WLATSLKRSQ ERTIRCLSCC TSHCFCLNGF LSSLCSILSH LLLYSTLLYS TRQQEKVVWF  840
GY*
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00232DAPTransfer from AT1G73360Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007010890.10.0Homeobox-leucine zipper protein HDG11
SwissprotQ9FX310.0HDG11_ARATH; Homeobox-leucine zipper protein HDG11
TrEMBLA0A061FQZ60.0A0A061FQZ6_THECC; Homeobox-leucine zipper protein HDG11
STRINGVIT_17s0053g00780.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM30042665
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G73360.10.0homeodomain GLABROUS 11
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]